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Abstract 

We consider a signalling format where information is modulated via a superposition 
of independent data streams. Each data stream is formed by simple replication and per- 
mutation of encoded information bits. Interconnection between data bits and modulation 
symbols transmitted over the channel can be represented in the form of a sparse graph. 
The modulated streams are transmitted by one or several users with a time offset enabling 
spatial coupling of the sparse modulation graphs. We prove that a two-stage demodu- 
lation/decoding method, in which iterative demodulation based on symbol estimation 
and interference cancellation is followed by parallel error correction decoding, achieves 
capacity of the additive white Gaussian noise (AWGN) channel. 

1 Introduction 

Recently, a technique of spatial graph coupHng applied to iterative processing on graphs 
attracted significant interest in many areas of communications. The method was first intro- 
duced to construct low-density parity-check convolutional codes (LDPCCCs) [6] that exhibit 
a so called threshold saturation behaviour [12] [13]. It has been proved that the asymptotic 
limit (threshold) of the suboptimal iterative decoding of LDPCCCs achieves the optimum 
maximum a posteriori probability (MAP) decoding threshold [13] in a variety of channels 
asymptotically. The idea of constructing graph structures from connected identical copies 
of a single graph has since been utilized to improve the performance of iterative methods in 
compressed sensing [7], image recognition, quantum coding [10], multi-user detection [8] [9], 
and many other fields. 

In this work we consider a communication format in which a sequence of modulated 
symbols at the transmitter is formed as a sum of equal power redundant independent data 
streams. The format is inspired by the multi-user modulation methods [4] [5] permuting parts 
of the spread data sequences and has a few features common with multi-level coding [2], bit 
interleaved coded modulation [3], and repeat-accumulate codes [18]. A single data stream is 
formed by blocks of information bits that are first encoded for error correction, then rephcated 
a number of times, and finally permuted. The main feature discussed in the paper is the data 
stream coupling accomplished by addition of the data streams (in real or complex domain) 
with a time offset. The receiver needs to carry out two tasks: suppression of the inter-stream 
interference and error correction decoding. We prove that these tasks can be efficiently 
accomplished by iterative data stream layering and subsequent error correction decoding 
performed for all data streams in parallel. 



To find the data rate achieved by the proposed modulation format we study a system of 
equations describing the evolution of the noise-and-interference power experienced by each 
data stream throughout the detection iterations as a function of the iteration number and 
transmission time. We then prove that the achievable communication rate is within a small 
gap from the AWGN channel capacity and the gap disappears asymptotically as the signal- 
to-noise ratio (SNR) grows. This result is a counterpart of the result [11] showing that 
sparse graph modulation can achieve AWGN capacity with modified successive interference 
cancellation decoding. In this work we show that it is sufficient to perform demodulation and 
decoding sequentially. No feedback loop is required between the two processes. 

The paper is organized as follows. The transmission format and the two-stage demodu- 
lation/decoding are described in Section 2. Section 3 presents a performance analysis and 
the proof of the main result. Numerical results are discussed in Section 4. Finally Section 5 
concludes the paper. 



2 System Model 

We consider a modulation technique in which the signal transmitted over the channel is 
formed by a superposition of L independently modulated data streams. These streams may 
initiate at a single or multiple terminals and then superimpose at the receiver. 

A schematic diagram of the modulator for one data stream is given in Figure 1. First a bi- 
nary information sequence tx = tti, M2, tfs, • • • , uk enters a binary forward error correction en- 
coder of rate R = K/N. Then each bit of the encoded binary sequence v = vi, V2, v^, - ■ ■ , vn, 
where vj G {—1,1}, is replicated M times. Finally the replicated data is permuted. In this 
paper we consider equal power data streams although generalization to unequal power streams 
is straightforward. 
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Figure 1: Modulation of one data stream. 
The modulated signal x = (xi, X2, X3, • • • ) is computed as the sum 



Xt = ^Vt,l, (1) 



1=1 

where vt^i is the permuted and replicated data of the Ith modulated data stream. The sequence 
X is multiplied by the power normalizing amplitude y^P/L and transmitted over the channel. 
In this paper we consider transmission over a real-valued AWGN channel. Thus, the received 
signal equals 

fp 

y = y^x + n (2) 
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where n = (ni, n2, • • • ) is a noise vector with standard iid Gaussian components of zero mean 
and variance o"^. The power of the modulated signal xt^PjL equals P since xt is a sum (1) 
of L independent binary random variables attaining the values —1 and 1 with probability 
1/2. Without loss of generality we can assume that P = \. Therefore, the total signal-to- 
noise ratio (SNR) is normalized to l/o"^. The total transmit data rate equals RL/M = aR 
information bits per channel use. The ratio a = L/M \s called the modulation load. 

2.1 Spatial Coupling of the Data Streams 

In this paper we consider the case when L modulated data streams add up with an offset. 
Transmission of the first data stream starts at time t = 1. For the first T2 — 1 symbol time 
intervals the modulated signal consists of a single data stream. After a delay of T2 symbols 
transmission of the second data stream is also initiated. For time instances t G [r2, T2 +T3 — 1] 
the transmitted signal consists of a superposition (a sum) of two data streams. Then after 
a delay of rs symbols the third data stream is also added and so on. Finally at time t = m 
Lth data stream is added to the system. This process is illustrated in Figure 2 for the case 
of L = 5 streams and T2 = T3 = r4 = rs = 1 . In each data stream a transmitted block of MN 
data symbols is immediately followed by the next block. We call this process of transmission 
initialization stream coupling. 

For the case of coupled data streams 

Lit) 

Xt = ^ vt^i t<m. (3) 
1=1 

where L{t) < L denotes the number of data streams in the system at time t. We notice 
that the proposed modulation format allows for a graph representation in which variable 
nodes representing coded data bits Vj (before replication) are connected to channel nodes 
representing modulation symbols xt. Therefore, the addition of data streams with an offset 
relates to spatial coupling of graphs representing modulated data streams. 
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Figure 2: Coupled modulated data streams. 



2.2 Iterative Demodulation and Decoding 



The received signal y contains M replicas of each transmitted hit Vj, j = 1,2, ■ ■ ■ . Let T{j) 
denote the set of indices t such that the signal Xt, and therefore yt, contains vj. Moreover, by 
J7'(t) we denote a set of all indices / such that Vj/ is included in yt. For each bit vj we use a 
set of received signals {yt}teT{j) ^'^ form a vector y^. Since each yt, t G T{j) contains vj we 
have 

yj = hvj + (4) 



where h = ^/l/L{l,l,■■■ ,1). The vector = (^^-j^ , ^^-,^2 ,•• • , ^^-jj^^), where r(j) = (ti, ^2, •• • ,*m) 
is the noise-and-interference vector with respect to the signal vj . The components of the noise- 
and-interference vector are 



^ Vj, + nt. 



t e T{j) . 



The vector can be approximated a Gaussian random vector (due to central limit theorem) 
with independent zero-mean components and covariance matrix Rj = diag((7jj ,a^^, - ■ ■ , af^ ) 
where at denotes the variance of ^j^f For t > m the cardinality of the set ^{t) is \J{t)\ = 
L giving the variance = {L — 1)/L + 1. However, since |i7'(t)| = L{t) < L for some 
t < m and since the variances are influenced by the interference cancellation throughout the 
demodulation iterations we prefer to write a general expression based on erf. 

We now perform minimum mean squared error (MMSE) filtering on y^ to form an estimate 
of Vj with the highest possible SNR. The output of this MMSE filter is 



Zj = Wjyj = Wjhvj + Wj$j , 



(5) 



where 



Wj = (I + h*Rj^h)-^h*Rj^ 



satisfies the minimization condition \ \wjyj — Vj\\ min. The resulting SNR of the signal Zj 
equals 

jj = h*R-'h = ^ E ^ • (6) 

ter(i) * 

Since Vj G {1,-1}, and takes each of the two values with probability 1/2, we can form a 
conditional expectation estimate Vj of Vj as 



Vj = K{vj\zj) = tanh(2;j7j) = tanh 



-y - 

'L ^ al 



Once the estimates Vj are generated for all data bits Vj, j = 1,2, 
starts with an interference cancellation step performed by computing 



y 



(1) 



hvj + 



(1) 



where h = y/l/L{l, 1, • • • ,1) and 



{vj' - Vji) + nt 



(7) 

the next iteration 
(8) 

(9) 
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We then proceed with calculating the new estimates Vj according to (5) and (7). 

The estimation and interference cancellation process is repeated for a number of iterations. 

(i) 

To avoid reusing the same information throughout the iterations we compute M vectors [ , 
t G T{j) for each bit Vj at each iteration i. Vector y^*] is composed of 

yS' = \ll-^ + \ll E (10) 

where t' E T{j),t' ^ t. The vectors y'^^\ are used to form M estimates zj'] according to (5) 

and corresponding vj*] estimates according to (7). 

We consider a two-stage decoding schedule. At the first stage / iterative demodulation 
iterations described above are performed. The second stage comprises simultaneous decoding 
of the forward error correction codes used to encode the information in the data streams prior 
to the modulation. 



3 Performance Analysis 

To simplify the derivation of the equations describing the evolution of the noise-and-interference 
power throughout the demodulation iterations we assume that each modulated data block of 
MN symbols consists of 2W + 1 equal length subblocks. Assume that the time t is measured 
in subblocks. We also assume that L data streams are split to 2W + 1 equal size groups with 
L/ {2W + 1) data streams in each group. The transmission of the first group of streams starts 
at t = 1, the transmission of the second group starts at t = 2 and so on. This situation is 
illustrated in Figure 2 for the case of = 2. 

The noise-and-interference power at iteration i and time t is given by 



2W + 



1 ^ 

— F 
' + 1 ^ 



1 1 



5m 



j=-W 



w ^ 



a2W + ^ — X 

i=-w -^i-i 



t+j+i 



(11) 



where the function 5'm(')i introduced in [19], is given by 



5m(a) = E (l-tanh(a + C^/a))^ , ^ ~ AA(0, 1 



and A/'(0, 1) denotes a standard normal random variable. The recursion (11) was derived 
in [1] for a multi-user system in which data estimation and detection as well as interference 
cancellation are also described by (7) and (10). The total system power is normalized to 1 
(i.e. individual data stream powers equal 1/a) and the noise power equals o"^. 

We assume that transmission starts at time t = 1, i.e., the load of the system increases 
gradually. At every time instant t, the modulation load is increased by a/{2W + 1) for 
t = 1,2, ■ ■ ■ , 2W + 1. As a result, the initial condition for recursion (11) can be formulated as 



= t < 



Xq 



t 



2W + 1 



+ Cj2; te[i,2W + i] 



l + a^ t>2W + 1 



(12) 
(13) 
(14) 
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Following / demodulation iterations the residual noise-and-interference power of a data 
bit in a block transmitted at time t equals The error correction decoding performed at 
the second stage of the two-stage reception process is successful iff 

^>e, vi (15) 

where 6 is the decoding threshold of the forward error correction code. The convergence of 
the SNR propagates along the received data stream with some speed s. Thus for each t the 
number of iterations required to achieve a specific SNR level needs to be I{t) = st, where s 
is a speed of convergence. 

3.1 Roots of the Regular Convergence Equation 

The convergence equation in the regular (uncoupled) case is 

+ a^. (16) 

axi-i ' 

In the uncoupled case when all the data streams are transmitted simultaneously with no time 
offset the system load equals a at any time t and all the data bits have the same SNR equal 
at iteration i irrespective of the time when the bits were transmitted. The corresponding 
equation 



x = gm[ — ]+'J^ (17) 



ax 



where x £ [0, oo) can have one, two, or three roots. We focus on the case when (17) has 
three roots x^^^ < x*-^^ < x^^^ and the uncoupled recursion (16) converges to the largest root 
x(^) while the coupled recursion (11) converges to the smallest root x^^^ of (16). As we will 
see later (Lemma 1) x(i) « while x^^^ > 0.3 which means that at high SNR the coupled 
system may operate near channel capacity (Theorem 1) while the uncoupled system is deemed 
to have very low rate. 

Note that the roots of (17) are functions of a and o"^. For o"^ > 0.07 we can define a 
threshold load value as(c7^) such that for a G [0, as((T^)) (17) has only one root, for a = Qs(cr^) 
(17) has two roots, and for a G (as(<7^),a] (17) has three roots, where 

a = a(a2) = l/21og2(l + l/cr^) . (18) 

Figure 3 shows the function 5m (^) + o"^ plotted for = 0.01 and three different values of 
a: a = 1.5 (magenta curve), a = 2.0115 = as(O.Ol) (blue curve) and a = 2.5 (green curve). 
The roots x^^\ x^'^\ and x^^^ are indicated for the case a = 2.5. Lemma 1 states upper and 
lower bounds on the roots x*-^-* and x^^^ which will be used in the proof of the main result. 

Lemma 1. (a) For < 1 and a G [0, a] 



(b) For < 0.01 and a G [4, a] 



< < (1 + e-l/^)^2 < _ (^g) 



1 + - - < x(^) < 1 + cr^ (20) 
a 
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Figure 3: Plot of gja{^/{xa)) + o"^ for o"^ = 0.01 and a = 1.5 (magenta curve), a = 2.0115 
as(cr^) (blue curve) and a = 2.5 (green curve). 



X 



(c) For < 0.01 

(d) For < 0.01 anda>2 

Proof. See Appendix A. 
Consider a recursion 



(3) = 1 + C72 



a(l + cr^) ~'~ \a 



0.3 < x^^) < 1 + CT^ 



a(xi_i + x(i)) 



,(1) 



(21) 

(22) 

□ 

(23) 

a,(T^), the smallest 



equivalent to (16) with respect to the shift of the roots by x^^^ 

root of (16). The roots of (23) equal 0, x^^-* —x^^\x^^'^ — x^^'^ where x^^\x^'^\ and x^^^ are the 
roots of (16). Here by equivalence we mean that the two recursions always converge to the 
values which differ by x^^\a,a'^). This can be easily shown by a variable exchange. 

A coupled recursion corresponding to (23) can be constructed in a form similar to (11). 
The two coupled recursions are also equivalent from convergence perspective and the resulting 
sequences differ by x^^\a,a'^). 



7 



3.2 Spectral Efficiency and Capacity 

System (11) operates with total signal power 1 and noise power cr^. The capacity of the 
(real- valued) AWGN channel for this parameters equals 

Cawgn(ct') = ^log2 + . (24) 

Assuming that the system (11) converges after limiting (infinite) number of iterations to 
the smallest root x^^^ of (16) the achieved spectral efficiency is 

Ceff(a,cr^) = oCbiawgn ' (^^) 

where Cbiawgn(-) is the capacity of the binary- input AWGN (BIAWGN) channel as a func- 
tion of the signal-to- noise ratio (SNR). Here we assume that the underlying forward error 
correction codes used for each data stream are optimal for the BIAWGN channel with SNR 

and therefore have rate R = Cbiawgn (^^)- 
We are now ready to state the main result of the paper. 

Theorem 1. 



as C7^ — 7- (and therefore CAWGNi^r'^) oo). By a* we denote the limiting modulation load 
value corresponding to a"^ . 

3.3 Proof of the Theorem 1 

To prove the main theorem we derive a the convergence condition of the coupled system using 
a technique to study scalar recursions [17] [15]. We then compute the achievable spectral 
efficiency (communication rate) and prove that it is within a small gap from the channel 
capacity. 

We define two functions 

/(X, «) = 5m - x) ) + - , (27) 

9{x) 4) - ^iT— ' (28) 
selected to ensure that the recursion 

ooi = f{g{xi^i),a) = 5m ( ^—ns) + - ^^'^ (29) 

\Xi_i +x(^) J 

coincides with (23). 

We consider < 1 and for each value of cr^ we consider a S [0,0;], where a is defined in 
(18). The following lemma states the properties of the functions f{x, a) and g{x). 

Lemma 2. For x G [0, 1] and a £ [0, a] 
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(a) /(O, a) = and /(x, 0) = 0, 

(h) f{x,a) is an increasing function of x, 

(c) g{0) = and g{x) is an increasing function of x, 

(d) f{g{x),a) is an increasing function of a. 

Proof See Appendix B and Appendix C. □ 
We now compute the potential function of the system (29) in the form 

def 

U{x,a) = / (z - f{g{z),a)) g'{z)dz 







X -\- ' (7 X I 

Lemma 2 (c) and (d) imply that U{x, a) is a non-increasing function of a for any x G [0, 1]. 

The method of potential functions [17] implies that a coupled version of system (23) 
converges to all-zero solution for a < a* satisfying 



a* = sup < a : min U(x, a) > 
[ xe[o,i] 

when W is chosen to be sufficiently large. The function U{x,a) is plotted for cj2 = 0.0129, 
0.003347, and 0.000855 and the corresponding a*{a^) = 3,4,5 in Fi gure 4. Note that a* = 
Q*(cr^) is a decreasing function of o"^. 

To find the minimum of U{x,a) for given a (and a"^) we compute its partial derivative 
with respect to x 



dU{x,a) 1 f 1 \ 1 



(31) 



dx x + x(^) (x-|-x(i))2 \a{x + xW)J (x-|-x(i))2 

The condition for extremum 

dU{x,a) ^ ^ 
dx 

is equivalent to 

x + xW =9^ (— ) +^2 . (33) 



(32) 



a{x + x(-^)) 

Depending on the value of a we can consider there cases (see Section 3.1) 

1. In case a G [0, a^) equation (33) has a single root x = corresponding to the minimum 
of U{x, a). This implies U{x, a) > for x G [0, 1]. 

2. In case a = as equation (33) has two roots: x = corresponding to the minimum and 
X = x^'^^ — x^^^ corresponding to a maximum of U{x, a). Again this implies U (x, a) > 
for X G [0, 1]. 
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Figure 4: Plot of U{x, a) for a = 3 and = 0.0129 (blue curve), a = 4 and = 0.003347 
(magenta curve), and a = 5 and o"^ = 0.000855 (black curve). 



3. Case a G (a^, a) corresponds to the case when (33) has three roots. U{x^ a) has two 
local minimums and the second minimum, located at the point Xm = x^^^ — can be 
negative for large a (the first local minimum is at x = 0). 

Therefore the value of x for which U{x,a*) = equals Xm = x^^^ — x^^^ and corresponds to 
Case 3. Computing C/(xm,a) gives 



U{x,a,a) = U{x^ ' - x^ \a) = \n-^ ^^^^^^3^ a -a / gra{y)dy . (34) 



1 



To find a* we consider equality 

C/(x(2) -x(^\a) = (35) 

which is equivalent to 



a;(3) 3,(3) _ ^(1) ""y^" 

xa)x(3) / ^-^2/)rfy = o (36) 



1 



.(3) 



a;(3) 2;(3) _ ^(1) 



1" ^ - ^(1)^(3) ^ - a2 In 2 ( Cbiawgn ( ^ 1 - Cbiawgn ( ^ ) ) = (37) 
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where the last equahty of the left hand side is due a relationship between MMSE (i.e. function 
5m(")) the mutual information for BIAWGN channel, derived in [14]. Continuing with 
dividing (37) by 2 In 2 we define a function 

.defl, 2;(3)(a,o-2) _ 1 x(3)(q,cj2) - 2;(^)(a,a2) ^ 

- « (cbiawgn [^^m\a,a^)) " ^^^^^^^ (ax(3)(a, a^)) ) ^^^^ 

where for fixed cj^ 

u{a) > for a G (Qs,a*) (39) 
u{a*) = . (40) 

Now we estimate a* using (38), (39), and (40). Applying the bounds on x^^^ and x^^^ given 
in Lemma 1 to some parts of the expression (38) we obtain 

, , 1 x(3) 1 + ^2-^2 

-2 (l + e-V^)a2 - 2hr2 ^2^(3) 

- a (cbiawgn + " ^biawgn (^(TT^) ) ' ^^^^ 

The next step is an application of the bounds on the capacity of BIAWGN channel 

2 

< Cbiawgn (7) < 1 • (42) 

The lower bound in (42) derived in [20] [16] is valid for 7 < 1 and we apply it to the last term 
in (41) 

(a(lla2)) - 21n2a(\ + a2) " 2 In 2a2(\ + ^2)2 ^^^^ 
since a(l + a"^) > 1 for a > as > I. The upper bound of (42) is used to derive 

Using the BIAWGN channel capacity bounds (43) and (44) in (41) we obtain 

1 x(3) 1 1 1 

u(a) _ 2 log2 ^ 21n2x(3) " ^ 2 In 2(1 + C72) 21n2a(l + a2)2 ^^^^ 

1 , 1 + 0-2 ^ ^(3) I I 

= 2 l°g2 ^ - « + 2 l°g2 + + - 21n2 xM 

1 1 

^ 2 In 2(1 + cj2) " 21n2a(l + ^2)2 ^ ^ 

= 7; log2 ^"-^^ - a + C(a) = a - a + ^(a) (47) 

where 

. ^ drf 1 a;(3) 1 l_ 1 1 . . 

Uaj 2 (l + e-i/-)(l + a2) 2 In 2 x(3) ^ 2 In 2(1 + ^2) 21n2a(l + ^2)2 ' ^ 
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Applying the asymptotic expression (c) of Lemma 1 we obtain an asymptotic expression for 



e(a) 



1 



2 In 2 



In 



a(l+o-2) 



(l + e-V-)(l + a2) 



+ 



ln(l + e-^/") 1 
21n2 21n2 



' 1 — 2W + ° 



+ 



a(l+(T2) 



q(1+0-2) 

ln(l + e-V<^) 1 

21n2 
ln(l + e-i/"^^ 



y2\2 



21n2 



+ 



2 In 2 
1 

2 In 2 



+ o 



a{l + a^)^ \aj a{l + a"^)^ ail + a^Y 



a(l + a2)2 a(l + cj2)^ 



+ 



Finally a combination of (47) and (52) implies 



C 



u{a) > a — a + ^(a) = a — a + B -\ \- o 



a 



where 



B 
C 



ln(l + e-^/'") 



21n2 



21n2 



+ 



(l + a2)2 (l + cj2)3 



Now we consider equation 



C 

a- a + B + — = {) 
a 



and its largest root 



a + B 



1 + W1 + 



4C 



a + B + 



C 



+ 



{a + BY \a 



- V V i^ + B?^ 
Since a* is the root of u{a) and u{a) > for a < a* from (53) we obtain 

a* = Or + o 

Using the fact that Cawgn(o'^) = a (see Section 3.2) and taking into account 

B = o 
C 



2 In 2 



a 



(49) 



(50) 

(51) 
(52) 

(53) 

(54) 
(55) 



(56) 



(57) 
(58) 
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we find that 



1 



a* = Cawgn(^') - ^ + o ] . (59) 



as (T — )• and Cawgn(o' ) oo. The achievable spectral efficiency (see Section 3.2) equals 

Ceff(«*,0-^) = Q*CbIAWGN • (60) 

From (38) and (40) we obtain 

1 x(^)(a%a^)-xW(a-,a^) o / 1 \ .... 

"21n2 xW(a*,aV=^)(a*,a2) " " " "^^^^^^ 

(a*, cr^) / 

= a + e(a*) (62) 

where the last equality comes from (45)-(47). Combing the above with (52) and (59) ffiially 
gives the main result 



L-AWGN(<7^J V'-AWGN('7 j 
Theorem 1 is proved. 

4 Numerical Results and Discussion 

The asymptotic expression in Theorem 1 demonstrates the asymptotic behaviour of the spec- 
tral efficiency and shows that the gap between the spectral efficiency and the AWGN channel 
capacity tends to zero as SNR increases. For small SNR we can obtain limiting system loads 
a* numerically. The gap between the achievable spectral efficiency and the AWGN channel 
capacity is plotted in Figure 5. We can observe that the gap is decreasing as the SNR and 
the channel capacity grows. 

5 Conclusion 

We consider modulation of information in a form a superposition of independent equal power 
and rate data streams. Each streams is formed by repetition and permutation of data and the 
streams are added up with an offset initiating the effect of "stream coupling" . We prove that 
the proposed system used with iterative demodulation followed by error correction deciding 
at the receiver achieves capacity of the AWGN channel asymptotically. 
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A Proof of Lemma 1 

We start with deriving the bound on x^'^\ The lower bound is trivial since gm{y) > 0, for 
y e (0, cxd) implies 

To derive an upper bound on x*-^^ we notice that 



(7^ <9m{^] ■ (64) 



2a^ >5m ( ]+(T^ . (65) 



which again follows from the fact that gmi') in non- negative and show that 

1 

To prove (65) we start with 

which follows from an upper bound on gm{-) derived in [19]. Continuing upper bounding Q{-) 
and using a < d we obtain 
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x(^) < ttQ I / ]+(J^ < iT^=Va2a^ exp ( ^)+c7^ (67) 



1 1 ^ ^ 1 - - 2m \ , 2 



exp( -^ + -ln- + -ln(2a^a) ) +c7^ (68) 
exp i , 1 , ^ + ^ In J + In (2^2-1- ln(l + ) 1 + (69) 



^ a^^lnil + l/a^) 2 2 2 \ 21n2 
< exp ^-^ + 2 ln(<T)^ + cr^ < 2o-2 . (70) 

The first inequality in (70) is valud for cj^ < 1 and is due to the fact that the term 
dominates the exponent in (69) as ci^ — )■ 0. The bound (65) is proved. 

Bounds (64) and (65) imply existence of the root cr^ < x^^^ < 2a^ for any a G [0, a] when 

<1. The root x^^-* then satisfies (70). Lemma 1 (a) is proved. 

The upper bound on x^^^ is again trivial since gm{y) < 1 for y G [0, oo] and therefore 

X^^^ = 9m ( — W ) + < 1 + ^2 . 



To prove the lower bound we use the bound 

9nM >i-y (71) 

valid for y G [0, oo], derived in [19]. Inequality (71) implies that the largest root of the 
equation 

x = l h cr^ 

ax 

is a lower bound on x^^\ Therefore 



1 + c^V-, , , 4 \ . , . 2 2 



> 1 + 1 - — ^ > 1 + - - . (73) 

2 V a(l + <T2)2y a ^ ^ 

Since the value inside the square root in (72) needs to be non-negative the bound can be used 
for 

4 

(1 + C72)2 ' 

i.e., essentially for a > 4. Lemma 1 (b) is proved. 
Applying asymptotic expression 

9m{y) = l- y + o{y) for y ^ 

obtained in [19] we find that for a — )• oo and x < 1 + 

5m( — ) +a'^ = 1- — +a'^ + o(- 
\ax J ax \a 
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This implies that (72) gives an asymptotic expression 

x(=^) = 1 + ct2-— ^ + ofl) . (74) 
a(l + a'^) \a J 

Lemma 1 (c) is proved. 

To derive a bound useful for a < 4 we start with showing that as(cT^) is a decreasing 
function of a^. We observe that as('7^) can alternatively be defined as a value of a satisfying 

1 / 1 \ 1 2 

max -flfni — + -o- = 1 , (75) 



where the maximum is achieved at x^'^\as{cr'^) , cr'^) = x^^\as{cr'^) , cr"^) ■ Clearly, for any o"^ < 

1 / 1 \ 

x(^)iasia^),a^f'^{ax(^){a,ia^),a^)) ^ x(3)(as(c72), ^2) ^ 

and therefore Qs((t2) > as(o"2) and the corresponding x^^^a"^) will bring the maximum to 1. 
Since as(o"2) is a decreasing function of cj^ then for any o"^ > 

asia"^) < as(0) « 2.0756 . (76) 

Our second step is to show that as{a'^)x^^^ where x^^^ is the root corresponding to as{cr'^) 
is also a decreasing function of cr^. By a variable exchange y = l/{ax) in equality (75) we 
transform it to an equivalent form 

- (yr9m(yr) + VvCT^) = 1 (77) 
Q 

where is a root of 

^ ^ {ygm (y) + ycr^) = ^ {yg^ (y) + 9m {y) + cj^) = o (78) 

corresponding to the local maximum of s{y) = yg^a (y) + yo"^. Function s{y) has a single 
extremum maximum for < 0.01. Function h{y) = yg^n (y) (blue curve) and its derivative 
(magenta curve) are plotted in Figure 6. It is easy to notice that the value y^ corresponding to 
the maximum of s{y) is an increasing function of a"^ and therefore as{(T'^)x^^^ = 1/yr decreases 
with cr^. Thus for arbitrary cr^ < 0.01 

as{a^)x^^\a^) > as(0.01)2;(3)(0.01) (79) 

which leads to 



x^'\a') > ^as(0.01)x(3)(0.01) > ^^x^'Hor^ '-^''^ 



^ 01) = ——0.3180 = 0.3082 , (80) 

Qs(cr^j as(0) 2.U756 

with an application of (76). Lemma 1 (d) is proved. 
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yg,(y) 
ci(yg Jy))/ciy 




-0.2 



Figure 6: Plot of ygm(y) (blue curve) and its derivative (magenta curve) 

B Proof of Lemma 2, (a)— (c) 

We start with proving property (a). The first statement 

/(0,a) =5: 



ax 



(1) 



(81) 



follows from the definition of x^^^ which is a root of (16). To prove the second statement we 
note that for a = equation (16) transforms into 



X = a 

As a result x^^'^ = and therefore for a = 



/(x,0) = lim om ( - ( -7TY 



x \ \ ^ a — x 



(1) 



a"^ — X 



(1) = . 



(82) 



(83) 



Property (b) can be easily verified since (7m(") is a strictly decreasing [19] function and 
therefore 



f{x,a) = gra[^i - X ] ] + cr^ - x^^^ 



(84) 



is strictly increasing for fixed a (in which case x^^^ is also fixed). 

The proof of property (c) is straightforward from the definition of the function g{-). The 
proof of property (d) is given in Appendix C. □ 

C Proof of Lemma 2, (d) 

The proof is split in a number of propositions. 
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Proposition 1. For any cr^ < 0.005 there exists e > such for any a G (0,a] and < 6 < e 
function 

rii{x,a,6) =^ gm( —-^—^] - gm( —] (85) 
\a[x + 0) J \ax J 

satisfies 

r]i{x, a, 6) > r?i(A, a, 6) (86) 
for any A € [cr^, 2cr^] and x G (A, 1 + 2fT^] . 

Proof. The derivative of the function ?/i(x, a, 6) with respect to 6 taken at 5 = equals 



dr]i{x, a, 6) 



85 

where y = 1/ {ax) . Function 



5=0 



-5m ( — ) ^ = -5m (y) y^oi , (87) 
\ ax J ax^ 



is plotted in Figure 7. Function f (y) is non-negative, has a single extremum, which is a 
maximum, at the point ym ~ 3.12, and satisfies 

v{0) = lim t;(y) = . 

y^oo 




Figure 7: Plot of function gm{y) (blue curve), — 5m(y)2/^ (magenta curve), g'^{y) (red curve), 
and —g[„{y)y (black curve). 



First we show that for any a G (0, a] 
v{y) > V 



y G 



1.02 1 



a aA 



(89) 
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We consider the following two cases. 



1. Case a < 1.02/ym 0.327. 

Statement (89) holds since v{y) is decreasing for y > y^. 

2. Case a > 1.02/y„ ^ 0.327. 

We note that -g'^^iy) G [0.06, 1] for y £ [0 

jUm]- Function g'^iy) is plotted in Figure 7 (red 

curve). Therefore 

t;(y)>0.06y2 ye[0,yj 

which implies 



1 

min min viy) > 0.06^ — , , -, 

(0.51og2(l + ^))2 



0.06 



To bound f(y) for large y we use an expression for g[j^{y) derived in [19]. 



viy) = y 



27r 

< 2y^e-y/^ 



TT^ 7r3(57r2-48) 241z 
(1 + y)7r - — + ^ + 



128y 



\z\ < 1 



where the last inequality is valid for y > 5. Since v{y) is decreasing for y G [t/m, ^ 



min v{y) = v 



1 



1 

a2a2 



for any a > 1.02/ym. It follows from (93) and (92) that 



2a2log2(l + ^ 



(90) 

(91) 
(92) 

(93) 



1 



< 



1 



aAj ^ 2a4logi(l + ^) ''''^ \^ 4a2log2(l + ^ 



(94) 



Finally we observe that for cx^ < 0.005 the right hand side of (90) is greater than the left 
hand side of (94). Thus v{y) attains its minimum at the point This proves (89). 
Using Taylor series expansion for r]i{x, a, 6) we obtain 



rji{x, a,6) = 6 



dr]i{x, a, 6) 



86 



1 .2 8'^r]i{x,a,6) 
^ 2 85^ 



6a ( V 



1 

ax 



6 , f 1 

+ -V —, =r 

2 \a(x^6) 



(95) 
(96) 



where < 6 < 6. Since 



|t;'(y)|<0.5 yG[0,oo) (97) 
then for sufficiently small 6 (independent of a) (96) together with (89) implies (99). □ 

Proposition 2. For any < 0.005 there exists an e > such for any a such that a > 
a + e > a > and < 6 < e function 



, . def 

r]2{x,a,6) = gm 



1 



[a + 6)x 



gn 



1 

ax 



(98) 
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satisfies 

■n2{x,a,6) > r]2{A,a,6) (99) 
for any A e [fi^, 2cj^] and x £ (A, 1 + 2a'^] . 

Proof. The proof follows the lines of the proof of Proposition 1. It focuses on an analysis of 
a function V2{y) = —g'^{y)y (plotted in Figure 7, black curve) which is a scaled derivative of 
772(2;, a, (5). □ 

Proposition 3. There exists an e > which only depends on cr^ such for any < 5 < e 
function 



Xa + 6){y + xW{a + 5,a'^))) ^"^ \a{y + x^^){ a,(T2))_ 

-x(^)(a + (5,CT2) + x(^)(a,cj2) > (100) 

for ye (0, 1 + fj^ - x(^) (a, cj^)] and q;>q; + 5>q;>0. 
Proof. Lemma 1 (a) implies 

2(7^ > x(^) {a + 6,a^)> x^^^ (a, a^) > a\ 
for (T^ < 1. For any given 6 we define 

62 = x'^^\a + 6,a^)-x^'\a,a') 

and select a sufficiently small e > O such that the conditions of Proposition 1 and Proposition 2 
are satisfied simultaneously. We notice that 



(/)(y, a, S) = gm{ ,^ , , ^^-1^,^ , ^ , ) - ft 

+ 5: 



a + 5)(y + (q, ^2) + ^2) ) ^ V (a + '5)(y + (a, ^2)) , 

(a + 5)(y + x(i)(a,a2))) " ^"^ (0(2/ + x(i)(a,a2)), 
-xW(a + 5,a2) + xW(a,cj2) (101) 

r/i (y + x'^"^^(a,(T^),a + (5, +^2 ( y + x'^^^(a, a^), a, 5 



- (a + 5, ^2) + (a, cj2) . (102) 
Proposition 1 and Proposition 2 imply 

</.(y,Q,<5) >0(O,a,(5) (103) 



,(a + (5)(y + xW(a + <5,a2))y ^"^^ \^a(y + x(i)( 

a, 0-2)), 

-xW(a + 5,(72) +xW(Q,a2) (104) 
= (105) 

which finally leads to (100). □ 

Proposition 3 implies that f{g{x),a + S) is an increasing function of 5 for any a such that 
a>a + 6>a>0 and < 6 < ^{cr'^)- Thus, f{g{x),a + 6) is an increasing function for 
a G [0, a]. Statement (d) of Lemma 2 is proved. 
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